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Abstract — We survey recent generalizations and improvements 
of the linear programming method that involve semidefinite 
programming. A general framework using group representations 
and tools from graph theory is provided. 

I. Introduction 

The celebrated linear programming method was until re- 
cently the most powerful method to obtain estimates for 
extremal problems in coding theory. Initially developed by 
Philippe Delsarte in the early seventies in the framework of 
association schemes |[T2l . it was proved equally suitable for 
two-point homogeneous spaces [10, Chapter 9]. We recall 
that a metric space X with distance dx(x,y) is said to be 
two-point homogeneous if the group of transformations of X 
preserving dx, which will be called the automorphism group 
of X and denoted Aut(X), acts distance transitively on X 2 . 
The binary Hamming space H n = {0, 1}™ is a core example 
of these spaces, and the fundamental number A(n, d), the 
maximal number of elements of a binary code of length n 
with minimal Hamming distance at least equal to d, can be 
efficiently upper bounded with Delsarte's method. It has lead 
not only to numerical bounds, but also to explicit bounds 1 19 1, 
and to the best known asymptotic bounds [22] . Moreover this 
method has been equally successful for the other two-point 
homogeneous spaces including real spaces such as the unit 
sphere of Euclidean space lH4l . fP71 . 

Despite of these great results, the search for improvements 
and for generalizations of this method have been fundamental 
issues in coding theory. In recent years, results in these 
directions have been obtained using semidefinite programming 
instead of linear programming. Semidefinite programming 
(SDP for short) is a subfield of convex optimization con- 
cerned with the optimization of a linear functional over the 
intersection of the cone of positive semidefinite matrices with 
an affine subspace. It contains linear programming (LP) as 
a special case. It is a field of recent and growing interest 
because on one hand it goes with efficient algorithms, and 
on the other hand it is capable of modeling or approximating 
many optimization problems, in particular in the area of 
combinatorial optimization. 

The first step was taken by A. Schrijver in [26 1 who, 
using SDP, was able to improve the known upper bounds 
for A(n,d) for some parameters (n,d). In [15] these results 



where extended to the g-Hamming space, and in [16] they 
are further improved. The idea underlying these results is 
to exploit constraints involving triples ll26l . flBI . or even 
quadruples of codewords fl6l . 

Generalizing the linear programming method to other metric 
spaces [X, dx) is of special interest in view of the variety of 
spaces that play a role in recent areas of coding theory, such 
as codes over rings, network coding, and space time coding. 
The LP method was successfully generalized to some of these 
spaces (e.g. the non binary Johnson space 11271 . permutation 
codes [28 1, the Grassmann spaces [1], [24], the ordered codes 
lETl . Q, the unitary codes ifTTI ). essentially because the 
underlying spaces are symmetric spaces. Here we mean that 
the group of automorphisms exchanges any two elements of 
the space. However, other spaces do not fit into this framework, 
such as the projective space over a finite field or a ball in the 
Hamming space. 

One can go from H„ to another metric space (X, dx) and 
ask for estimates for the analogous number A(X, d). One can 
also stick to the Hamming space but consider other types of 
constraints, such as constraints involving fc-tuples of elements 
instead of pairs. To be more precise, we want to consider 
on the Hamming space H n , some functions f(x%, . . . , Xk) 
taking non negative values, defined on fc-tuples, that generalize 
the Hamming distance. Such a function is called a pseudo- 
distance if it satisfies two properties: is is invariant by a 
permutation of the Xi and it is invariant by the diagonal action 
of Aut(H n ). Several such pseudo-distances have been studied 
in coding theory, for example the generalized Hamming dis- 
tance introduced in [9|. Then, one can ask for Ak-i(n, f, m), 
the maximal number of elements of a binary code C such 
that f(xi, . . . , Xk) is at least equal to some value m when 
(xi, . . . , Xk) runs over the set of fc-tuples of pairwise distinct 
elements of C. It can be noticed that Ai(n, d, m) = A(n, m). 
We show in [4] that Schrijver's method l26l can be used to 
derive upper bounds for A^in, f, m) and in [5| that it can be 
generalized to fc > 3. 

Our aim in this paper is to give a general framework 
for the problems discussed above, based on a combination 
of tools from graph theory, and from the theory of group 
representations. Indeed, these situations can be interpreted 
in terms of the independence number of specific graphs (or 
hypergraphs); on the other hand, an upper bound for the 



independence number of a graph, which is the optimal value 
of a SDR, was discovered by L. Lovasz [20| who called it 
the theta number of the graph. In order to exploit this upper 
bound in coding theory, it is necessary to exploit the action of 
the automorphism group of the underlying space, and this step 
requires tools from group representations. We shall illustrate 
these ideas with the cases of projective space, Hamming balls 
and generalized Hamming distance. In this paper we restrict 
ourselves to finite spaces, although the ideas and results extend 
almost straightforwardly to the case of compact spaces (see 
0, 11). 

The paper is organized as follows: Section II reviews 
Delsarte's LP method for binary codes. Section III discusses 
Lovasz theta number of a graph. Section IV introduces 
semidefinite programs and their symmetrization. Section V 
links the theta number and the LP bound in the case of 
binary codes, and introduces the notions relative to a general 
space X. Section VI gives the necessary results from group 
representations. Section VII develops applications to codes in 
Hamming balls and to codes in the projective spaces. Section 
VIII discusses the applications to pseudo-distances. 

II. Delsarte's LP method for binary codes 

We take the following notations: for x an element of the 
binary Hamming space H n := {0,1}™, the Hamming weight 
wt(x) is the number of its non zero coordinates and the 
Hamming distance of a pair (x,y) 6 H 2 equals dji(x,y) = 
wt(x — y). A certain family of orthogonal polynomials, the 
Krawtchouk polynomials KJ}(t), are naturally attached to the 
Hamming space, and satisfy the so-called positivity property: 

ForallCcff n , £ K%(d H (x, y)) > 0. (1) 

This property lies at the root of Delsarte LP method. Let us 
introduce the distance distribution (xi)o<i< n of a code C: 

x% ■■= \{{x,y) e C 2 : d H (x,y)=i}\. 

Then, these numbers satisfy the following inequalities: 

1) For all < k < n, YTi=o K k(i) x ^ > °- 

2) Xl > 

3) x = l 

4) £^o** = |C| 

where 1) rephrases ([T). Moreover, if rfjj(C) > d, then Xi = 
for i = 1, . . ,,d — 1. From these inequalities, one obtains a 
linear program in real variables t/j, the optimal value of which 
upper bounds the number A(n, d) lfl2l : 

max { E"= Vi '■ °) 

yo = i, (2) 

Ui = if i = l,...,d-l ' 
E£=o#fc(i)»i>0 0<fc<n} 

In view of generalizations of this method, the role of the 
Krawtchouk polynomials should be clarified. In fact, these 
polynomials come into play because they are closely related to 



the irreducible decomposition of the space C(H n ) of complex 
valued functions on H n : 

C(H n ) := {/ : H n C} 

under the action of the group Aut(JJ n ) of transformations of 
H n preserving the Hamming distance. This group, of order 
2™n!, combines swaps of and 1 with permutations of the 
coordinates. 

More precisely, if Xz(x) '■= (— l) x ' z denote the characters 
of (F™ , +), we have: 

C{H n ) = ® ze H n C Xz = ® r k l =o p k 

where := @ w t{z)=k^Xz are Aut(l? n ) -irreducible sub- 
spaces, and the Krawtchouk polynomials can be defined by 

Kl{d H {x,y)) := Xz{x) X z{y) 

wt(z)=/c 

which in turn lead to the explicit expression: 

*m-h-vQ (:_-;). 

III. Lovasz' s theta number 

Let r = (V 1 E) be a finite graph. An independent set S of T 
is a subset of V such that no pair of vertices in S is connected 
by an edge, in other words S 2 PI E = 0. The independence 
number a(T) is then the maximal number of elements of an 
independent set. 

The number A(n, d) studied in coding theory can be in- 
terpreted as the independence number of a particular graph, 
i.e. the graph T(n,d) with vertex set V — H n and edge 
set E = {(x,y) 6 H 2 : < d H (x,y) < d}. So the 
methods developed in graph theory in order to estimate the 
independence number can be applied. It turns out that the exact 
determination of this graph invariant is a hard problem, but that 
a relaxation was defined by L. Lovasz l20l under the name 
of the theta number, which is computable with polynomial 
complexity in the size of the graph. More precisely, Lovasz 
theta number i?(T) is the optimal value of a semidefinite 
program: 

§(T) = max { Bi,j : B - i/),,!, ,., ,. B^O 

Li B i,i = 1) 

5^ = (i,j)eE) 

where the matrix B is indexed by the vertex set V = 
{1, . . . ,v} and where B >z stands for: B is a symmetric, 
positive semidefinite matrix. The celebrated Sandwich theorem 
is proved in l20l : 

Theorem 1: If x(r) denotes the chromatic number of the 
complementary graph T, then 

a(T) < i?(T) < X (T). 

Proof: We only prove the first inequality. Let S be an 
independent set and let I5 denote its characteristic function. 
The matrix 

B z , r .= -^l s (t)ls(j) 



is feasible for the program i?(r) and moreover its optimal 

value B i,j is e q ual to \ s \- So \ s \ < ^( r )- ■ 
However, a straightforward application of the inequality 
a(r) < &(r) in the case of binary codes would not be 
satisfactory because the graph T(n, d) has 2™ vertices, thus 
its size grows exponentially with the dimension n. The key 
to reduce the complexity of the computation of l d(T{n,d)) 
is to exploit the action of Aut(i? ra ) on this graph. In this 
process, up to a minor modification (i.e. one should consider 
$'(T(n, d)), in which the constraint that B takes non negative 
values is added), $'(r(n, d)) turn to be equal to Delsarte linear 
programming bound (see Section V and [25]). 

The situation described above is in fact very general. In 
coding theory, the spaces of interest are always huge spaces, 
but also have a huge group of automorphisms. Thus symmetry 
reduction will play a crucial role in the design of upper bounds 
of z9 type for extremal problems. 

IV. Semidefinite programs 

A semidefinite program (SDP for short) is an optimization 
problem of the form: 

7 := min { c\X\ H V c m x m : 

-A + x x Ai H h x m A m y 0} 

where (ci,...,c m ) G R m , Aq, . . . , A m are real symmetric 
matrices, and the minimum is taken over (x\, . . . , x m ) G R m . 
Linear programs correspond to the special case of Ai being di- 
agonal matrices. The above primal program has an associated 
dual program, defined below, where (A, B) = Trace(A_B*) is 
the standard inner product of matrices: 

7* := max{ (A ,Z) : Z^O, 

(Ai, Z) = Ci, i = l,...,m}. 

Weak duality, i.e. 7* < 7, always holds. Under some mild 
conditions, one has also strong duality, i.e. 7 = 7*. In this 
case, interior point methods lead to algorithms that allow to 
approximate 7 to an arbitrary precision in polynomial time. 
Moreover free solvers are available, e.g. on the web site NEOS 



Let G be a group of permutations of {1, ...,r}. It acts 
on matrices of size r by: (aA)ij — A^-i^^-i^, a G G. 
The SDP 7* is said to be G-invariant if the matrices Ai are 
of size r, if the set {Z : Z >z 0, (Ai,Z) — c.;} of feasible 
solutions is globally invariant by G, and if aAo = Aq for all 
a G G. In this case, if Z is a feasible solution, then another 
feasible solution Z' with the same optimal value and which 
is moreover invariant by G is obtained by an average of Z on 
G, i.e. setting 

1 1 oeo 

This reasoning shows that, if 7* is G-invariant, one can restrict 
the feasible solutions to be G-invariant. In other words, 



r 



-{ (A Q ,Z) 



ZhO, 

aZ = Z for a G G, 
{Ai,Z) = Ci }. 



Going from 7* to (7*)° is referred to as symmetry reduction 
or symmetrization of the SDP 7*. 

V. Back to the roots 

We come back to d'(n,d), which is Aut(ff„) -invariant. 
Its symmetrization involves thus the matrices B indexed by 
H n , which are positive semidefinite, and Aut(iJ„)-invariant. It 
turns out that there is a beautiful description of these matrices 
with help of the Krawtchouk polynomials. We now adopt a 
functional notation for matrices, i.e. we write B(x,y) instead 
°f Bx,y 

Theorem 2: B G C(H 2 ) is positive semidefinite and G- 
invariant if and only if 



d 



Kl(d H {x,y)) with a k > 0, < k < d. 



This result shows that the condition B >z can be replaced 
by the non negativity of the variables a k - Replacing in 1?', one 
obtains a linear program in the variables (do, . . . ,a n ). With 
a little bit of transformations, one can show that it is equal to 
Delsarte linear program. 

Now we consider following the same line for a metric space 
(X, dx ) with group of automorphisms G. With the obvious 
graph T(X,d), we have similarly 



A(X, d) < tf'(T(X,d)) c 



(3) 



Then we need a description of the G-invariant positive 
definite functions F G C(X 2 ), i.e such that the matrix 
(F(x,y))( Xt y) £ x 2 is (Hermitian) positive semidefinite. This 
description can be obtained using harmonic analysis of G, 
and is explained in next section. 

VI. Tools from harmonic analysis 

We shall be rather sketchy here and refer to [6 | for details. In 
J6|, the more general case of compact groups is considered. 
The space C(X) is a G-module for the action (gf)(x) := 
f{g~ 1 x) thus can be decomposed in irreducible submodules. 
So we have 

C{X) = 1 R™ 1 _L • • • _L 

where the subspaces R k are pairwise non isomorphic and 
G-irreducible. Then, for all k = 0, . . . , s, one can define a 
G-invariant matrix Ek(x,y), of size rn^, associated to the 
isotypic subspace R™ k , such that we have: 

Theorem 3: F G C(X 2 ) is positive definite and G-invariant, 
if and only if 



F(x, y) = Y,{Fk,E k (x, y)) with F k h 0. 



(4) 



fc=0 



Moreover, since E k (x,y) is G-invariant, its coefficients only 
depend on the orbits Oq(x, y) of pairs {x, y) G X 2 under the 
action of G, i.e. we have 

E k (x,y) = Y k (0 G (x,y)) 



for some matrix Y k . It remains to explicitly compute this 
matrix, which is a non trivial task in general. Special cases 
will be worked out in the next section. Replacing F y by 
the expression in $'(r(X, d)) then leads to a semidefinite 
program in the "variables" F k y 0. Here we can see exactly 
when this SDP turns to be an LP: since the matrices Fk have 
size rrik, it corresponds to the cases when m k = 1 for all 
< k < s. One can show that it is so if X is a symmetric 
space as defined in the Introduction. 

VII. Bounds for codes in Hamming balls and in 
Projective geometry 

The projective space X = V q>n over F q , the set of all 
linear subspaces of F™, is a metric space for the distance 
dx(x,y) :— dim(x) + dim(y) — 2dim(a; n y). Its automor- 
phism group is the group G = Gl n (F g ) of invertible linear 
transformations. The codes of this space have found recent 
applications in network coding lfl8l . Its action on X is not 
transitive; there are n + 1 orbits, the subsets Xk of subspaces 
of fixed dimension k, < k < n. The sets Xk themselves 
are two-point homogeneous, and Delsarte in IfPJI , who calls 
them q-Johnson spaces, has shown that they can be seen as 
q-analogs of the Johnson spaces, i.e. the sets of binary words 
with fixed weight. This analogy in fact extends to the pairs 
(X, G) when X is the full projective space over F g and G 
is the linear group G1„(F 9 ), respectively the Hamming space 
and the symmetric group S n . We discuss these situations in a 
uniform way, with the notations of ([5). 



X 




H n 


q 




1 


G 


G1„(F 9 ) 


Sn 


\x\ 


dim(a;) 


wt(x) 



(5) 



G splits X into the orbits X k : 

X k :={xeX : \x\ = k} 
while the orbits of X 2 are: 

X a ,b,c ■= {(x, y) e X 2 : \x\ = a, \y\ =b,\xHy\ = c}. 
The distance on these spaces also has a common expression: 

d x (x,y) = \x\ + \y\-2\xny\. 

In IPT3I . the G-decomposition of the spaces C(X k ) and the 
associated polynomials are determined (since the spaces are 
two-point homogeneous, the multiplicities m k are equal to 1). 
They belong to the family of q-Hahn polynomials. From these 
results one can go one step further and infer the computation 
of the matrices Y k for the space X (El): 

Theorem 4: The space C(X) contains 1 + [n/2\ isotypic 
subspaces indexed by < k < [n/2\, with multiplicities 
rrifc = n — 2k + 1, corresponding to irreducible spaces Rk 
of dimension hk- The coefficients of the associated matrices 
Ek(x,y) are explicitly given by the formulas: 
rj— fc] rn-2fcn 

E kiiJ (x,y) = \X\h k li - k l} j - kl qW-VQ k (n,i,r,i-\xny\) 

[j\ LiJ 



where k <i < j <n~k, \x\ — i, \y\ — j, Ek,ij(x,y) = if 
\x\ ^ i or \y\ ^ j, and Q k (n,i,j;t) are q-Hahn polynomials 
with parameters n, 

As an application, it is possible to derive from (01 upper 
bounds for A(J q ^ n ,d) and for A(B n (w),d) where B n (w) is 
the Hamming ball of radius w centered at 0: 

B n (w) :— {x G H n : wt(a;) < w}. 

Some numerical results are displayed in Table I. The stars 
indicate optimal bounds, attained by the intersection of the 
Golay code with B n (w). 
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4095* 



TABLE I 
SDP BOUNDS FOR A(B n (w), 8) 



VIII. Bounds for binary codes related to 

PSEUDO-DISTANCES 

We begin with the introduction of three pseudo-distances 
that have been studied in coding theory. For (xi, . . . ,x k ) G 
the generalized Hamming distance d(xi, . . . ,x k ) is de- 
fined by: 

d(x u ...,x k ) = \{j, l<j<n: x^{O k ,l k }}\. 

where x^ := ((xi)j, . . . , (x k )j) denotes the j-th column of 
the array: 

x x = 0... 01... 1100... 
x 2 =0...01...1011...0 

x k =0...01...1 001^___1 

d(x 1 ,...,x k ) 

This notion was introduced in [9|, and takes its origin in 
the work of Ozarow and Wyner, and of Wei, who studied 
the generalized Hamming weight of linear codes in view of 
cryptographic applications. When k — 2, d(x\,X2) is nothing 
else than the usual Hamming distance. 

The radial distance has connections with the notion of 
list decoding (|8|). The radial distance r(xi, . . . , x k ) is by 
definition the smallest radius of a Hamming ball containing 
the points x\, . . . , x k : 

r(xi,. ,.,Xk) = min { max d(y,Xi)\. 

Because this parameter is difficult to analyse, it is sometimes 
studied jointly with the average radial distance (|8|) 

r(xi,...,x k ) :=min{- d(y,Xi)}. 

V l<i<k 



We want to define an upper bound for the number 
Ak-i(n, /, m) relative to a pseudo-distance /, that resembles 
Lovasz's theta number. In view of the proof of the inequality 
a(r) < #(r) of Theorem Q] it is natural to consider the 
function 

Xc{zi, . . ■ , Zk) ~ j-L lc(zi) ■ • ■ lc(zk) (6) 

associated to a binary code C, and to work out a semidefinite 
program from its properties. With this line of thought, we 
obtain in the simplest form: 

Theorem 5: |[5) The optimal value of the following SDP is 
an upper bound of Ak-i(n, /, m): 

F satisfies (1) - (4)} 

where: 

(1) F(zi,...,z k )=F({z 1 ,...,z k }) 

(2) (x,y)^F{x,y,z 3 ,...,z k )h0wd >0 

(3) F{z x , . ..,z k ) = Oif f(zi, ...,z k ) < m-l and z { ± zj 

(4) E xe H n F(x) = l 

A slightly stronger condition is used in (5) instead of (2). 
In order to compute effectively with this program, it is again 
necessary to reduce it with Aut(iJ„), which amounts to ex- 
press the Aut(_ff„) -invariant functions F satisfying condition 
(2). This step can be completed with an analysis of the 
positive definite functions on H n which are invariant under 
the stabilizer of k — 2 elements (2:3, . . . , z^). The case k = 3 
corresponds to the stabilizer of one element, which can be 
chosen to be the zero word, thus to the group S„, so this case 
is contained in Theorem|4] The resulting symmetrized program 
for k — 3 coincides with the program used in [26 1 (with of 
course a change in condition (3)). In (4j, numerical bounds 
for k = 2 and for the three pseudo-distances defined above 
are computed and compared to the previous known bounds. It 
turns out that in almost every case the SDP bound is better. In 
0, numerical results are obtained for k — 4, i.e. for quadruple 
functions. However, it seems difficult to consider larger values 
of k, because the size of the resulting SDP is of order of 
magnitude n? _1 , 

Remark 6: There is also a graphic view point on Theorem 
[5] Indeed, the semidefinite program that we have defined, 
upper bounds the independence number of an hypergraph if 
H„ is replaced by its vertex set and if condition (3) is replaced 
by: F(zi, . . . , Zk) — if {z\, . . . , z^} is an hyperedge of the 
hypergraph. 

Remark 7: The semidefinite bound presented in lfl6l . in- 
volves functions F defined on the set Sk{H n ) of subsets of 
H n of cardinality at most k. The semidefinite constraints on 
F are as follows: for all S C Sk(H n ), 

(X,Y) ^F(Xl)Y) hO 

where X, Y run over the elements of Sk{H n ), containing S, 
and of size at most (fc + \S\)/2 (so that \X U Y\ < k). The 
case I = k — 2 corresponds to condition (2). 



The authors obtain with k = 4 new upper bounds for 
A(n, d) for sixteen values of (n, d) in the range 18 < n < 26 
and 6 < d < 12. Remarkably, the new bound in the case 
(n, d) — (20, 8) reaches the lower bound provided by succes- 
sive shortening of the Golay code, thus proves A(20, 8) = 256. 
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